Optimizing Resource Provisioning in Shared Cloud Systems

نویسندگان

  • Christina DELIMITROU
  • Christos KOZYRAKIS
چکیده

Cloud computing promises flexibility and high performance for users and cost efficiency for operators. To achieve this premise, cloud providers offer several provisioning strategies including long-term reserved resources and short-term ondemand resources. Determining the most appropriate provisioning strategy is a complex, multi-dimensional problem that depends on the load fluctuation, interference sensitivity and duration of incoming jobs and the performance unpredictability and cost of the provisioned resources. We first compare the two main provisioning strategies (reserved and on-demand resources) on Google Compute Engine (GCE) using three representative workload scenarios with mixes of batch and latency-critical applications and increasing levels of load variability. We show that either approach is suboptimal from the performance or cost perspective. We then explore hybrid provisioning strategies with both reserved and on-demand resources. We design policies that account for the resource preferences of incoming jobs to automatically determine which jobs should be mapped to reserved versus on-demand resources based on overall load, and resource unpredictability. We demonstrate that hybrid configurations improve both performance and cost-efficiency compared to fully reserved and fully on-demand systems. Specifically they improve performance by 2.1x compared to fully on-demand provisioning, and reduce cost by 46% compared to fully reserved systems. We also show that hybrid strategies are robust to variation in system and job parameters, such as cost, and system load.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Debt-Aware Learning Approach for Resource Adaptations in Cloud Elasticity Management

Elasticity is a cloud property that enables applications and their execution systems to dynamically acquire and release shared computational resources on demand. Moreover, it unfolds the advantage of economies of scale in the cloud through a drop in the average costs of these shared resources. However, it is still an open challenge to achieve a perfect match between resource demand and provisio...

متن کامل

Multi-Agent Based Dynamic Resource Provisioning and Monitoring In Cloud Computing Systems

The cloud computing paradigm provides a shared pool of resources and services with different models delivered to the customers through the Internet via an on-demand dynamically-scalable form charged using a pay-per-use model. The main problem we tackle in this paper is to optimize the resource provisioning task by shortening the completion time for the customers’ tasks while minimizing the asso...

متن کامل

Application of Selective Algorithm for Effective Resource Provisioning in Cloud Computing Environment

Modern day continued demand for resource hungry services and applications in IT sector have led to development of Cloud computing. Cloud computing environment involves high cost infrastructure on one hand and need high scale computational resources on the other hand. These resources need to be provisioned (allocation and scheduling) to the end users in most efficient manner so that the tremendo...

متن کامل

A Survey on Resource Provisioning in Cloud

Cloud Computing allow the users to efficiently and dynamically provision computing resource to meet their IT needs. Companies are able to rent resources from cloud for storage and other computational purposes so that their infrastructure cost can be reduced. Further they can make use of company-wide access to applications based on pay-as-you-go model. Hence there is no need for getting licenses...

متن کامل

Optimizing resource topologies of workload in the cloud by minimizing consumption and maximizing utilization while still meeting service level agreements

Reducing the total cost of ownership (TCO) and meeting the service level agreements (SLA) of a computing service or application in the Cloud remains a challenge. The traditional method of meeting SLA is to size production systems based on expected peak workloads, thus this leads to very low compute resource utilization. Our experience shows typical average utilization rates of 35 40%, topped on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014